Continuous emotion tracking using total variability space

نویسندگان

  • Hossein Khaki
  • Engin Erzin
چکیده

Automatic continuous emotion tracking (CET) has received increased attention with expected applications in medical, robotic, and human-machine interaction areas. The speech signal carries useful clues to estimate the affective state of the speaker. In this paper, we present Total Variability Space (TVS) for CET from speech data. TVS is a widely used framework in speaker and language recognition applications. In this study, we applied TVS as an unsupervised emotional feature extraction framework. Assuming a low temporal variation in the affective space, we discretize the continuous affective state and extract i-vectors. Experimental evaluations are performed on the CreativeIT dataset and fusion results with pool of statistical functions over mel frequency cepstral coefficients (MFCCs) show a 2% improvement for the emotion tracking from speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A quantitative investigation on lung tumor site on its motion tracking in radiotherapy with external surrogates

Introduction: In external beam radiotherapy each effort is done to deliver 3D dose distribution onto the tumor volume uniformly, while minimizing the dose to healthy organs at the same time. Radiation treatment of tumors located at thorax region such as lung and liver has a challenging issue during target localization since these tumors move mainly due to respiration. There are...

متن کامل

Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction

Speaker variability has been shown to be a significant confounding factor in speech based emotion classification systems and a number of speaker normalisation techniques have been proposed. However, speaker normalisation in systems that predict continuous multidimensional descriptions of emotion such as arousal and valence has not been explored. This paper investigates the effect of speaker var...

متن کامل

Expressive Gait Synthesis Using PCA and Gaussian Modeling

In this paper we analyze walking sequences of an actor performing walk under eleven different states of mind. These walk sequences captured with an inertial motion capture system are used as training data to model walk in a reduced dimension space through principal component analysis (PCA). In that reduced PC space, the variability of walk cycles for each emotion and the length of each cycle ar...

متن کامل

MediaEval 2015: A Segmentation-based Approach to Continuous Emotion Tracking

In this paper we approach the task of continuous music emotion recognition using unsupervised audio segmentation as a preparatory step. The MediaEval task requires predicting emotion of the song with a high time resolution of 2Hz. Though this resolution is necessary to find exact locations of emotional changes, we believe that those changes occur more sparsely. We suggest that using bigger time...

متن کامل

Deterministic and Stochastic Methods for Gaze Tracking in Real-Time

Psychological evidence demonstrates how eye gaze analysis is requested for human computer interaction endowed with emotion recognition capabilities. The existing proposals analyse eyelid and iris motion by using colour information and edge detectors, but eye movements are quite fast and difficult for precise and robust tracking. Instead, we propose to reduce the dimensionality of the image-data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015